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1 


atggcggccg 


aagcctcgga 


gagcgggcca 


gcgctgcatg 


agctcatgcg 


cgaggcggag 


61 


atcagcctgc tcgagtgcaa ggtg'tgcttt 


gagaagtttg 


gccaccggca 


gcagcggcgc 


121 


ccgcgcaacc tgtcctgcgg ccacgtggtc 


tgcctggcct 


gcgtggccgc 


cctggcgcac 


181 


ccgcgcactc 


tggccctcga 


gtgcccattc 


•tgcaggcgag 


cttgccgggg 


ctgcgacacc 


241 


agcgactgcc 


tgccggtgct 


gcacctcata 


gagctcctgg. 


gctcagcgct- 


tcgccagtcc 


301 


ccggccgccc 


atcgcgccgc 


ccccagcgcc 


cccggagccc 


tcacctgcca 


ccacaccttc 


361 


ggcggctggg 


ggaccctggt 


caaccccacc 


ggactggcgc 


tttgtcccaa 


gacggggcgt 


421 


gtcgtggtgg 


tgcacgacgg 


caggaggcgt 


gtcaagattt 


ttgactcagg gggaggatgc 


481 


gcgcatcagt 


ttggagagaa 


gggggacgct 


gcccaagaca 


ttaggtaccc tgtggatgtc 


541 


accatcacca 


acgactgcca 


tgtggttgtc 


actgacgccg 


gcgatcgctc 


catcaaagtg 


601 tttgattttt 


ttggccagat 


caagcttgtc 


attggaggcc 


aattctcctt* 


accttggggt 


661' gtggagacca 


cccctcagaa 


tgggattgtg 


gtaactgatg 


cggaggcagg 


gtccctgcac 


721 


ctcctggacg 


tcgacttcgc 


ggaaggggtc 


cttcggagaa 


ctgaaaggtt 


gcaagctcat 


781 


ctgtgcaatc 


cccgaggggt 


ggcagtgtct 


tggctcaccg 


gggccattgc 


ggtcctggag 


841 


caccccctgg 


ccctggggac tggggtttgc 


agcaccaggg 


tgaaagtgtt tagctcaagt 


901 atgcagcttg tcggccaagt ggataccttt 


gggctgagcc 


tctactttcc 


ctccaaaata 


961 


actgcctccg 


ctgtgacctt 


tgatcaccag 


ggaaatg^ga 


ttgttgcaga 


tacatctggt 


102.1 


ccagctatcc 


tttgcttagg 


aaaacctgag 


gagtttccag 


taccgaagcc 


catggtcact 


1081 


.catggtcttt 


cgcatcctgt 


ggctcttacc 


ttcaccaagg 


agaattctct tcttgtgctg 


1141 


gacacagcat- 


ctcattctat 


aaaagtctat 


aaagttgact 


gggggtgatg ggctggggtg 


1201 


ggtccctgga 


atcagaagca 


ctagtgctgc 


cattaa'tgaa 


ttgtttaacc 


ctggataagt 


1261 


cacttaaact: 


catctatcca 


ggcagggata 


attaaaacca 


tctggcagac ttacaaagct 


1321 


tgggacagtt 


attggagatt 


aatctaccat 


ttattgaatg 


catactctgt 


gcaaggaaat 


1381 


ttgcaaatat 


tagcttattt aatctgtact 


atccagtgag 


gtaatttctt 


cccccccaag 


1441 


atagagtcaa 


gctctgtcac ccaggctgga 


gtgcagaagc 


atgatcacag 


ctcactacag 



SEQ- It) ua- \ 



Appl'nNo.: 10/567,074 
Title: Lafora's Disease Gene 
Inventors: Stephen W. Scherer, et al. 
wo 2005/01 Annotated Sheet 



i/001449 



8/15 



Fig6B 



EPM2B protein sequence 

MAAEASESGPALHELMREAmSLLECKVCFEKFGMRQQRRPRNLSCGm^ 

CXACrVAALAHPRTLALECyFauiACRGGI)TSDCIJ>VLHLIELLGSALRQS 

PAAHRAAPSAPGALTCHHTFGGWGTLVNPTGLALCPKTaR\'WVfIDGRRR 

VKIFDSGGGCAHQFGEKGDAAQDmYPVDVTITMDCHVVVTDAGDRSIKV 

FDFFGQIKLVIGGQFSLPWGVETTPQNGIVVTDAEAGSLHLLDVDFAEGV 

I^TERLQAffl,CKPRGVAVS\^^TGAIAVLEHPLALGTGVCSTRVKVFSSS 

MQLVGQVDTFGLSLYFPSKITASAVTFDHQGNVIVADTSGPAILCLGKPE 

EFPWKPMVTHGI^HPVALTFTKINSIXVLDTASHSIKVYKVDWG 



Appl'nNo.: 10/567,074 
Title: Lafora's Disease Gene 
Inventors: Stephen W. Scherer, et al. 
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Promoter C5') sequence: 

1 CCCCAAGGCC CCCCCGGCCC 

• 51 AGGCCCCCCG GCCCCAAGCC 

101 CCCCGGCCCC CCGCCCCCAG 

151 CAAGCACCCA GCCCCAGCAC 

201 CCCAGCCCCC GCCCCAGCAC 

251 CCCAGCCCCC GTCCCCCCCC 

301 ACCCAGCAGG GGACTGCAAA 

351 TCTAGTTTTG CrTTGCCGTT 

401 GAGCCTGTTT CCCGTCGCGG 

451 CTGCCTGAAG GTCACGGGCC 

501 GCGTCCGCTC CCGCGCCCTC 

551 ACCGCAGGCC GCGGCCGA6A 

601 CCGCCCCGCC CCGCCCCGCC 

651 CCG6CCCCGG ACCGAGCGGC 
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CCAGGCAACC CCAGGCCCCC AGGCAACCCA 
CCCCAGGTTC CCGGCCCCAA GAACCAAGCC 
acCCAGCAC CAAGCCCCCG CCCCCCGCCC 
CCAGCCCCCG CCCCAGCCCC AGCCCCAGCA 
CCAGCCCCAG CACCCAGCCC CCGCCCCAGC 
CCAGCACCCA GCCCCAGCCC CAGCAGCAGC- 
GCGTAGGCTA CCCCAGGTGG AACACCGTGT 
TGCAGCCTGG GCGATC6GGG GCCACCGCTC 
AAAGCGGAGC CGCCCCGCCC CGCCCCCCGC 
TGGGCaGCG GCGC6CGGTG CGGCCCGCGA 
CCCACTCAGC GCCCGCCCGC CCGCCGGGGG 
GGCTGCGCGC TGCGCCCGCG ACGTCAGGCC 
CC6TGACCCG CCCCGGCCCC 66CCCCGGCC 
GCCCGCGGGA GC6GCGGCGG CCGCGCG 



Coding sequence: 

ATG 

701 GGGGCCGAAG CGGCGGGGAG CGGGCGGGCG aCCGGGAGC TGGTGCGCGA 

751 GGCCGAGGTC AGCTTGCTCG AGTGCAAGGT GTGCTTCGAG AGGTTCG6CC 

801 ACCGCCAGa GCGGCGCCCG CGCAACcTGC CCTGCGGCCA CGTGGTGTGC 

851 CTGGCCTGCG TGGCGGCCCT GGCGCACCCG CGGACGCTGG CCCTGGA6TG 

901 CCCCTTCTGC CGCCGGGCCT GCCGCGGCTG CGACACCAGC 6ACTGCCTGC 

951 CGGTGCTTCA CCTCCTG6AG CTCCTGGGCT CGGCGCTGCG CCCAGCCCCC 

1001 GCCGCCCCCC 6CGCCGCCCC CCGCCCCGCC CCCTGCGCCC CGGGCGCCCT 

1051 CGCCTGCCAT CACGCGTTCG GAG6CTGGGG GACCCTGGTC AACCCCACGG 

1101 GGCTGGCGCT 6T6CCCCAAG ACCGGGCGGG TCGTGGTGGT GCACGACGGC 

1151 AGGAGGCGGG TCAAGATCTT TGACTCCGGG GGAGGATGCG CCCATCAGTT 

1201 TGGAGAGAAG 6GGGAGGCTG CCCAGGACAT TAGGTACCCC CTGGACGTCG 

1251 CCGTCACCAA CGACTGCCAC GTGGTTGTCA CCGACGCCGG CGACCGCTCC 

1301 ATCAAAGTGT TTGATTTCTT TG6CCAGATC AAGCTCGTCA TTGGAGACCA 

1351 GnTTCGTA CQTGGGGCG TGGAGACCAC CCCTCAGAAT GGGGTCGTGG 

1401 TAAaGACGC CGAGGCAGGG TCGCTGCACC TGCTGGAAGT CGACTTTGCA 

1451 GAAGGAGCCC TCCAGAGGAC TGAAAAGCTG CAAGGTCATC TGTGCAACCC 

1501 GCGAGGGGTG GCCGTGTCa GGCTCACTGG GGCCATTGCG GTCCTGGAGC 

1551 ACCCTCCGGG GCTGGGGGCT GGGGCGGGCA GCACCGCCGT GAAG6TGTTC 

1601 AGCCCA ACTA TGCAGCTGAT CGGCCAGGTG GATACCTTTG GGCTCAGCCT 

1651 CTTTTTCCCC TCTAGAATAA CCGCCTCCCC CGTGACCTTT GATCACCAGG 

1701 GGAATGTGAT TGTTGCAGAT ACTTCTAGTrC AGGCCGTCCT ATGCTTGGGA 

1751 CAGCaGAGG AATTTCCAGT CaGAAGCCO ATCATCACCC ATGGTCTTTC 

1801 CCATCCTGTG GCACTGACCT TCACCAAGGA GAATTCTCTT CTTGTGCTGG 

1851 ACAGTGCAGC CCATTCCGTA AAAGTCTACA AGCCTGACTG GG66TAA 



wo 2«M)r 
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Met Gly Ala Glu Ala Ala Gly Ser Gly Arg Ala Leu Arg Glu Leu Val 
1 ' 5 10 15 

Airg Glu .Ala Glu Val Ser Leu Leu Glu Cys Lys Val Cys Phe Glu Arg 
20 25 30 

Phe Gly His Arg Gin Gin Arg Arg Pro Arg Asn Leu Pro Cys Gly His 
35 40 45 

Val Val Cys Leu Ala Cys Val Ala Ala Leu Ala His Pro Arg Thr Leu 
50 55 60 



Ala Leu Gill Cys Pro Phe Cys Arg Arg Ala Cys Arg Gly Cys Asp Thr 
65 70 . 75 80 

Ser Asp Cys Leu Pro Val Leu His Leu Leu Glu Leu Leu Gly ser Ala 
B5 90 95 



Leu Arg Pro Ala Pro Ala Ala Pro Arg Ala Ala Pro Arg Ala Ala Pro 
•100 105 110 



Cys Ala Pro Gly Ala Leu Ala Cys His His Ala Phe Gly Gly Trp Gly 
115 120 125 

Thr Leu val Asn Pro Thr Gly Leu Ala Leu Cys Pro Lys Thr Gly Arg 
130 135 140 



Val Val val Val His Asp Gly Arg Arg Arg Val Lys lie Phe Asp Ser 
i45 150 155 160 

Gly Gly Gly Cys Ala His Gin Phe Gly Glu Lys Gly Glu Ala Ala Gin' 
165 170 175 

Asp He Arg Tyr Pro Leu Asp Val Ala Val Thr Asn Asp Cys His Val 
180 185 190 

Val Val Thr Asp Ala Gly Asp Arg ser He Lys Val Phe Asp Phe Phe 
195 200 205 



Gly Gin He Lys Leu Val He Gly Asp Gin Phe Ser Leu Pro Trp -Gly 
210 215 220 
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Towards maltn's 
^RING finger 

LRP APAAPRAA 



• CAtiis fanilieris 



Mus tmstmlaa CTCCACGOGTCCCCG*- 
Rattvs, xiorve^ictt? CTCCACSCGTCCCCG— 



Towanb malin's 
NKLd 



R. A A P C A 
ICCCCTGCGCC 




Sm sero/a (Pig) CTTCGCCaSGCGCCC- 

Somo sapiens ■ CTTCGCCAGTCCCCG 

Pan croglodytas CTTCCCCWSTCCCCG — 

(Chimpanxae) 



mCGCG. 
"GCrSCCCrCAgCGCCGCCHXTGTGCS 
-GCGGCCTCCCGCGGCGCCCCCTCCTCC 

CCAGCGCC 



W M 



Non-deaminatcd 
DNA 




Deaminated DNA 




Fells cstur" 
Panch&ra 2eo (Lion) 
PanthBra ti^rls {Tiger} 
Panthera pjjxftjs- (Leopard) 



AGCCCCC' 



ACCCCCC- 
AGCCCCC- 
AGCCCCC- 

Psitthsra vr.lcla (Snow Leopard) AGCCCCC 



AGCCCCC- 
AGCCCCC- 



AGCCCCC — 
AGCCCCCGCCGC( 
AGCCCCC- 



/ Canoidae / 


Arctoidac 




. r T 

60Ma 50Ma 


— 1 

lOMa 



Aclnonyx jvbacus (Cheetah) 
Lynx caxflcal (Lyxnc) 

Oasis £ami2iMSia 

Csnis lupus Dingo 

Cams lupus (Grey Wolf) 

Canls rafua (Red Kolf) 

CanXs Istzms (Coyote t 

Canis aureus (Golden Jaclcal} 

Cuon alpinus (Dholo) 
Dusicyon gri^eue (Grey Ton) 

Urocyon ll^taralia (IsX. Ftax) 

OmT^pacus aealstrlBZns (skunk) 
Ursus aamrScanus' (Black Bear) 



Ursvs maritimjs (Polar Boar) 
tirsus mrczos (Erovoi Bear) 

ASCQCCC- 

Vraus AjTCtos hoxrlbJlla (Grizzly) 
Procyoaiciae Jo tor (Raccoon] AGSCCCC- 
Patas £laws (Kin)ca:)ou} AG:;cccc< 
Bessaricyon ijoodari JBeddard) ACGCCCC- 
fjasua naava (Ring Called Coati 1 A6;;C0CC- 
Gulo ffuio (Meiverino) AC3CCCC- 
Gallctis vitzata (Grison) AGiJCCCC- 




-GCCGCCCACCGCGCCGCCCCciGreS 



CCCCCCGCGCCGCCCCCCGCGCCGCCCCCrOOGCC 
— - — -GCCGCCCOCCGCGCCGCCCCCTQXCC 



GCCGCCCCCCGCGCCGCCCCCTGeGCC 

C(3CGCC6CCCCCCGC<^CGCCCCCTGSGCC 
GCCGCCCCCCGCGCCGCCCCCTGeGCC 




ZCCCCTCCGCC 

AGCCCCCGCCGCC:CCCCGCGCCGCCCCCCGCGCC6CCCCCTGCGCC 

:cco 

AGCCCCC— 
AGCCCCC— 




Uuatela viaoi: (Americas Kink) AGGCCCC- 
Martes pennantl (Flaher) ACrcccc^ 
Lucxa canaOlens (Octer) AGGCCCC- 
L. aacuiieollis (Sported^neck: otter) c— 
MelOQale BKischata — AGiSOCCC— 
(Chinese rerxet Badger) 



GCCGCCCCCCQCGCCGCSCCCieOGCC 

GCCGCCCCCCGCGCCGCGCCCTGOSCC 

GCC6CCCCCCGCGCCGTi3CCCTGCBCC- 

(SJCCCCCCCCCCGCCGCGCCGTCCSCC 

CC^OCCCu'roCTGCCCC 
luCCGTGCGCC 
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1 atggcggccg aagcctcgga gagcgggcca gcgctgcatg agctcatgcg cgaggcggag 



121 


ccgcgcaacc 


tgtcctgcgg 


ccacgtggtc 


tgcctggcct 


gcgtggccgc 


cctggcgcac 


IBl 


ccgcgcactc 


tggccctcga 


gtgcccattc 


tgcaagcgag 


cttgccgggg 


ctgcgacacc 


241 


agcgactgcc 


tgccggtgct 


gcacctcata 


■gagctcctcq 


gctcagcgct 


tcgccagtcc 


301 


ccggccgccc 


atcgcgccgc 


ccccagcgcc" 


ccccgag'ccc 


tcacctgcca 


ccacaccttc 


361 


ggcggctggg 


ggaccctggt 


caaccccacc 


ggactggcgc 


ttt.gtcccaa 


gacggggcgt 


421 


gtcgtggtgg 


tgcacgacgg 


caggaggcgt. 


gtcaagattt 


ttgactcagg gggaggatgc 


481- 


gcgcatcagt 


ttggagagaa 


gggggacgct 


gcccaagaca 


ttaggtaccc tgtggatgtc 


541 


accatcacca 


acgactgcca 


tgtggttgtc 


actgacgccg 


gcgatcgctc 


catcaaagtg 


601 


tttgattttt 


ttggccagat 


caagcttgtc 


attggaggcc 


aattctcctt 


accttggggt 


661 


gtggagacca 


cccctcagaa 


tgggattgtg 


gtaactgatg 


cggaggcagg 


gtccctgcac 


721 


ctcctggacg 


tcgacttcgc 


ggaaggggtc 


cttcggagaa 


ctgaaaggtt 


gcaagctcat 


781 


c L.crtrrpaa'h r* 






■f- pf 4- !a 

u y ^ (.» u w<3 t^w^ 


gggccattgc 


ggtcctggag 


841 


caccccctgg 


ccctggggac 


tggggtttgc 


agcaccaggg 


tgaaagtgtt 


tagctcaagt 


901 


atgcagcttg 


tcggccaagt 


ggataccttt 


gggctgagcc 


tctactttcc 


ctccaaaata 


961 


actgcctccg 


ctgtgacctt 


tgatcaccag 


ggaaatgt-ga 


ttgttgcaga 


tacatctggt 


1021 


ccagctatcc 


tttgcttagg 


aaaacctgag 


gagtttccag 


taccgaagcc catggtcact 


1081 


catggtcttt 


cgcatcctgt 


ggctcttacc 


ttcaccaagg 


agaattctct tcttgtgctg 


1141 


gacacagcat 


ctcattctat 


aaaagtctfft 


aaagttgact 


gggggtgatg ggctggggtg 


1201 


ggtccctgga 


atcagaagca 


ctagtgctgc 


cattaatgaa 


ttgtttaacc 


ctggataagt 


1261 


cacttaaaCk 


catctatcca 


ggcagggata 


attaaaacca 


tctggcagac ttacaaagct 


1321 


tgggacagtt 


attggagatt 


aatctaccat 


ttattgaatg 


catactctgt 


gcaaggaaat 


1381 


ttgcaaatat 


tagcttattt 


aatctgtact 


atccagtgag 


gtaatttctt 


cccccccaag 


1441 


atagagt^caa 


gctctgtcac 


ccaggctgga 


gtgcagaagc 


atgatcacag ctcactacag 



61 



atcagcctgc 



tcgagtgcaa ggtgtgcttt gagaagtttg 



gccaccggca gcagcggcgc 



SEQ ID NO: 1 
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EPM2B protein sequence 

MAAEASESGPALHELMREAEISLLECKVCFEKFGHRQQRRPRNLSCGH\^ 

(XACVAALAHPRTLAI£CPFaUlACRGCI)TSDCIJ>V^ 

PAAHRAAPSAPGALTCHHTFGGWGTLVNPTGLALCPKTGRWWHDGRRR 

VKIFDSGGGCAHQFGEKGDAAQDmTVDVTITNDCHVVVTDAGDRSIKV 

FDFFGQIKLVIGGQFSLPWGVETTPQNGIVVTDAEAGSIilLLDVDFAEGV 

LKRTERLQAHLQvIPRGVAVSWLTGAIAVLEHPLALGtGVCSTRVKVFSSS 

MQLVGQVDTFGLSLYFPSKITASAVTFDHQGNVrVADTSGPAILCLGKPE 

EFPWKP]vrVTHGLSHPVALTFTKENSLL\nj:>TASHSIKVYKVDWG. 



SEQ ID NO: 2 
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Promoter C5') sequence: 



*r^^55*^^ CCCCCGGCCC CCAGGCAACC CCAGGCCCCC AGGCAACCCA 

.1^ ^55^5^^^^^ GCCCCAAGCC CCCCAGGTTC CCGGCCCCAA GAACCAAGCC 

' It^ r^^S?^^^^^ CCGCCCCCAG CACCCAGCAC CAAGCCCCCG CCCCCCGCCC 

151 CAAGCACCCA GCCCCAGCAC CCAGCCCCCG CCCCAGCCCC AGCCCCAGCA 

201 CCCAGCCCCC GCCCCAGCAC CCAGCCCCAG CACCCAGCCC CCGCCCCAGC 

251 CCCAGCCCCC GTCCCCCCCC CCAGCACCCA GCCCCAGCCC CAGCAGCAGC 

301 ACCCAGCAGG GGACTGCAAA GCGTAGGCTA CCCCAGGTGG AACACCGTGT 

351 TaAGTTTTG CTTTGCCGTT TGCAGCCTGG GC6ATCGGGG GCCACCGCTC 

401 GAGCaCTTT CCCGTCGCGG AAAGCGGAGC CGCCCCGCCC CGCCCCCCGC 

451 CTGCCTGAAG GTCACGGGCC TGGGCCTGCG GCGCGCGGTG CGGCCCGCGA 

501 GCGTCCGCTC CCGCGCCCTC CGCAGTCAGC GCCCGCCCGC CCGCCGGGGG 

551 ACC6CAGGCC GCGGCCGAGA GGCTGCGCGC TGCGCCCGCG ACGTCAGGCC 

601 CCGCCCCGCC CCGCCCCGCC CCGTGACCGG CCCCGGCCCC GGCCCCGGCC 

651 CCG6CCCC6G ACCGAGCGGC GCCCGCGGGA GC6GCGGCGG CCGCGCG 



449 



Coding sequence: 

ATG . 

701 GGGGCCGAAG CGGCGGGGAG 

751 GGCCGAG6TC AGCTTGCTCG 

801 ACCGCCAGCA GCGGCGCCCG 

851 CTGGCCTGCG TGGCGGCCCT 

901 CCCCTTCTGC CGCCGGGCCT 

951 CGGTGCTTCA CaCCTCGAG 

1001 GCCGCCCCCC GCGCCGCCCC 

1051 CGCCTGCCAT CACGCGTTCG 

U01 G6CTGGCGCT 6TGCCCCAAG 

1151 AGGAGGCGGG TCAA6ATCTT 

1201 TGGAGAGAAG GGGGAGGCTG 

1251 CCGTCACCAA CGACTGCCAC 

1301 ATCAA AGTGT TTGATTTCTT 

1351 G 1 i i I CCTTA CCTTGGGGCG 

1401 TAACTGACGC CGAGGCAGGG 

1451 GAAGGAGCCC TCCAGAGGAC 

1501 GCGAGGGGTG GCCGTGTCa 

1551 ACCCTCCGGG GCTGGGGGCT 

1601 AGCCCAACTA TGCAGCTGAT 

1651 CTTTTTCCCC TCTAGAATAA 

1701 GGAATGTGAT TGTTGCAGAT 

1751 CAGCCTGAGG AATTTCCAGT 

1801 CCATCCTGTG GCACTGACCT 

1851 ACA6TGCA6C CCATTCCGTA 



CGG6CGGGCG CTGCGGGAGC TGGT6CGCGA 
AGTGCAAGGT GTGCTTCGAG A6GTTCG6CC 
CGCAACcTGC CCTGCGGCCA CGTGGTGTGC 
GGCGCACCCG CGGACGCTG6 CCCTGGA6TG 
GCCGCGGCTG CGACACCAGC GACTGCCTGC 
CTCaGGGCT CGGCGCTGCG CCCAGCCCCC 
CCGCGCC6CC CCCTGCGCCC CGGGCGCCCT 
GAGGCTGGGG GACCCTGGTC AACCCCACGG 
ACCGGGCGGG TCGTGGTGGT GCACGACGGC 
TGACTCCGGG GGAGGATGCG CCCATCAGTT 
CCCAGGACAT TAGGTACCCC CTGGACGTCG 
GTGGTTGnrCA CCGACGCCGG CGACCGCTCC 
TGGCCAGATC AAGCTCGTCA TTGGAGACCA 
TGGAGACCAC CCCTCAGAAT GGGGTCGTGG 
TCGCTGCACC TGCTGGAAGT CGACTTTGCA 
TGAAAAGCTG CAAGGTCATC TGTGCAACCC 
GGCTCACTGG GGCCATTGCG 6TCCTGGAGC 
GGGGCGGGCA GCACCGCCGT GAAG6TGTTC 
CGGCCAGGTG GATACCTTTG G6CTCAGCCT 
CCGCCTCCGC CGTGACCTTT GATCACCAGG 
ACTTCTACrC AGCCCGTCCT ATGCTTGGGA 
CaGAAGCCC- ATaiCACCC ATGGTCTTTC 
TCACCAAGGA GAATTCTCTr CTTGTGCTGG 
AAAGTCTACA AGGCTGACTG 6GGGTAA 



SEQ ID NO: 3 
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Met Gly Ala Glu Ala Ala Gly Ser Gly Arg Ala Leu Arg Glu Leu Val 
1*5 XO 



15 



Arg Glu Ala Glu Val Ser Leu Leu Glu Cys Lys Val Cys Phe Glu Arg 
20 ' 25 .30 

Phe Gly His Arg Gin Gin Arg Arg Pro Arg Asn Leu Pro Cys Gly His 
35 40 45 

Val Val Cys Leu Ala Cys Val Ala Ala Leu Ala His Pro Arg Thr Leu 
50 55 60 

Ala Leu Glu Cys Pro Phe Cys Arg Arg Ala Cys Arg Gly Cys Asp Thr 
€5 70 75 80 

Ser Asp Cys Leu Pro Val Leu His Leu Leu Glu Leu Leu Gly Ser Ala 
V B5 .90 95 

Leu Arg Pro Ala Pro Ala Ala Pro Arg Ala Ala Pro Arg Ala Ala Pro 
100 105 110 



cys Ala Pro Gly Ala Leu Ala Cys His His Ala Phe Gly Gly Trp Gly 
115 120 125 

Thr Leu Val Asn Pro Thr Gly Leu Ala Leu Cys Pro Lys Thr Gly Arg 
130 135 140 



Val Val Val Val His Asp Gly Arg Arg Arg Val Lys lie Phe Asp Ser 
145 150 155 160 

Gly Gly Gly Cys Ala His Gin Phe Gly Glu Lys Gly Glu Ala Ala Gin 
165 170 175 

Asp lie Arg Tyr Pro Leu Asp Val Ala val Thr Asn Asp Cys His Val 
180 185 190 

Val Val Thr Asp Ala Gly Asp Arg Ser lie Lys Val Phe Asp Phe Phe 
195 200 205 ■ 



Gly Gin He Lys Leu Val He Gly Asp Gin Phe Ser Leu Pro Trp Gly 
210 215 220 



SEQ ID NO: 4 
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Towards maJin's 
^RING finger 



Towaids main's 
NHL domains ^ 



W M C 



Mus aausenlus 




Non-deaminated 
DNA 



CTCCUCGCGTCCCCG — 
Raztus norVK^ieQ9 CTCCACSOGTCCCCG— 



-GCTGCCCTCAGaiCCGCCCCCTTCGCG 
— GCTGCCCT&G 



w c 



SOS scro/a (Pis) CTTCGCCC6GCGCCC 

Homo Sag>iens CTTCGCCAGTCCCCG- 

P^n troglodytms CTTCGCCACTCCCCG—-— 
(OhinpansAe} 



GCCGCCCMt:GCCCCGCC0CCA.6CGCC 
CCAGOGCC 




Deaminated DNA 




5DMa 



lOMa 



Fells catvs" 
PancherB 2bo (L>lon} 

Panznara zigrls (Tiger) AECCCCC- 
Panthers partiua- (Leopard} AGCCCCC 
Panthera vr.dcia (Snow laopareJ) AGCCtXC 
Ac jjionyx jubatua (Cheerah) 
Lynx caracal {Lyrat} 




AGCCCCC 
AGCCCCC 



■GCCGCOytCCGCGCCGCCCCCkGCGCC 



Can iff fnmili mris 

Csoas lupus Din^ 

Caals lupus (Grey Uoaf) 

CarIs ratuo (Rsd Holf) 

Canis latzxns (Coyote t 

Ca/iis aureus '(Golden jaclcBl} 

Cuofi alpinus (Ohole) 
Dusicyon gu:i«eu« (Grey Pox) 

Uroeyon lititoralls (Isl. Pox) 

Conqpattrs seial stria tax (Skimie) 
Ursus aamricBtms (Black Bear) 

Urstts aaritimus (Polar Boar) 
tirsus aretos (Brown Boar) 



Uraua arctoa horrlbSlla (Grizzly) CCCC' 
ProeyoDidae Jotor (Raccoon) AG3CCCC- 
Potos Haws (KinJcajou) AG?;cccC' 
Bessaricyon JyeOdari (BeOOard) AGGCCCC- 
Masui aaava (Riagcalled Coati ) AGf^CCCC- 
Guio gulo (Holverino) AGGCCCC' 
GaLictls victata (Griaon) AG^fCCCC- 
Uuatela vison (American Kink} AG^CCCC- 
Mactes pennajiti (Tiahox) AGTCCCC- 
Xutra caaatHena (Otter) AGGCCCC* 
L. aacullcDllia ( Sport ed-aeek otter) C< 
Melo^ale moschata - MBmoOZZ' 
(Chinese Ferret Badger) 



AGCCCrCSCOGCCCCCCGCeCCGCCCCCCGCGCCGCCCCCrGCGCC 

AGCCCCC — CCCCCCCCCCCCGCCGCCarCTGOCCC 

AGCCCCCSCCGCCCCCCGCGCCGCCCCCCgCGCCGCCCCCTCCGCC 

AGCCCCCSCCGCCCCCCGCGCCGCCCCCCGCGCCGCCCCCTGCGCC 

AGCCCCC — -GCCCCCCCCCCCGCCGCCCCCrCCGCC 

AGCCCUaja:bCCUCCCGCGCC SU.LJLU.Jb CGCCTCCCCCTGCroC 

AGCCCCC GCCGCCCCOGGCGCCGCCCCCTGCSCC 

AGCCCCCGCCECCCCCCGCGCCGCCCCCCGCGCCCCCCCCTGCCCC 

AGCCSXr ^— GCCGCCCCCCCCCCCCCCCCCTGCGCC 

AGCCCCCGCC8CCCCCCGCGCCC0CCCCCGCGCCGCCCCCTGCGCC 
AGCCCCC---—— — ~GCCGCCCCCCCCGCCGCax:crCCGCC 
AGCCCCCGCraCCCCCCGCGCCGCCCCCCGCGCCGCCCCCTGCGCC 

:CCCCT GCGCC 
^mCCCCCCTGOSCC 

Asooccc— ■■ ■■ i -cccsocecoascsccGccccu w s Ga : 




'GCC<SCCCCCT(SCACC 
mGCGCCGCT COCTG06CC 



; C CCC C C<»CCC GC CCT(»CGCC 
CCCT(S06CC 




-HaOCGCCCCCCSCOCCGCJCOGTGCGOC 



Fig 10 



SEQ ID NOS: 6-52 



